Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios

Identifieur interne : 002A84 ( Main/Exploration ); précédent : 002A83; suivant : 002A85

Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios

Auteurs : S. Y. Kung [États-Unis, République populaire de Chine] ; Yuhui Luo [États-Unis] ; Man-Wai Mak

Source :

RBID : ISTEX:DEC7884D0F11AB1F2B7BBF66EAC1290D1D036BB3

English descriptors

Abstract

Abstract: An effective data mining system lies in the representation of pattern vectors. For many bioinformatic applications, data are represented as vectors of extremely high dimension. This motivates the research on feature selection. In the literature, there are plenty of reports on feature selection methods. In terms of training data types, they are divided into the unsupervised and supervised categories. In terms of selection methods, they fall into filter and wrapper categories. This paper will provide a brief overview on the state-of-the-arts feature selection methods on all these categories. Sample applications of these methods for genomic signal processing will be highlighted. This paper also describes a notion of self-supervision. A special method called vector index adaptive SVM (VIA-SVM) is described for selecting features under the self-supervision scenario. Furthermore, the paper makes use of a more powerful symmetric doubly supervised formulation, for which VIA-SVM is particularly useful. Based on several subcellular localization experiments, and microarray time course experiments, the VIA-SVM algorithm when combined with some filter-type metrics appears to deliver a substantial dimension reduction (one-order of magnitude) with only little degradation on accuracy.

Url:
DOI: 10.1007/s11265-008-0273-8


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios</title>
<author>
<name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
</author>
<author>
<name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
</author>
<author>
<name sortKey="Mak, Man Wai" sort="Mak, Man Wai" uniqKey="Mak M" first="Man-Wai" last="Mak">Man-Wai Mak</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:DEC7884D0F11AB1F2B7BBF66EAC1290D1D036BB3</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1007/s11265-008-0273-8</idno>
<idno type="url">https://api.istex.fr/ark:/67375/VQC-RV3XCRW0-V/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000209</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000209</idno>
<idno type="wicri:Area/Istex/Curation">000209</idno>
<idno type="wicri:Area/Istex/Checkpoint">000798</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000798</idno>
<idno type="wicri:doubleKey">1939-8018:2008:Kung S:feature:selection:for</idno>
<idno type="wicri:Area/Main/Merge">002B10</idno>
<idno type="wicri:Area/Main/Curation">002A84</idno>
<idno type="wicri:Area/Main/Exploration">002A84</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios</title>
<author>
<name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
<affiliation wicri:level="4">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Princeton University, Princeton, NJ</wicri:regionArea>
<orgName type="university">Université de Princeton</orgName>
<placeName>
<settlement type="city">Princeton (New Jersey)</settlement>
<region type="state">New Jersey</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>National Chung Hsing University, 250 Kuo Kuang Rd., 402, Taichung, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
<affiliation wicri:level="4">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Princeton University, Princeton, NJ</wicri:regionArea>
<orgName type="university">Université de Princeton</orgName>
<placeName>
<settlement type="city">Princeton (New Jersey)</settlement>
<region type="state">New Jersey</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Mak, Man Wai" sort="Mak, Man Wai" uniqKey="Mak M" first="Man-Wai" last="Mak">Man-Wai Mak</name>
<affiliation>
<wicri:noCountry code="subField">SAR</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Journal of Signal Processing Systems</title>
<title level="j" type="sub">for Signal, Image, and Video Technology(formerly the Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology)</title>
<title level="j" type="abbrev">J Sign Process Syst</title>
<idno type="ISSN">1939-8018</idno>
<idno type="eISSN">1939-8115</idno>
<imprint>
<publisher>Springer US; http://www.springer-ny.com</publisher>
<pubPlace>Boston</pubPlace>
<date type="published" when="2010-10-01">2010-10-01</date>
<biblScope unit="volume">61</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="3">3</biblScope>
<biblScope unit="page" to="20">20</biblScope>
</imprint>
<idno type="ISSN">1939-8018</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">1939-8018</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Feature selection</term>
<term>Filter</term>
<term>Genomics</term>
<term>Microarray</term>
<term>Self-supervised</term>
<term>Sequence</term>
<term>Supervised</term>
<term>Unsupervised</term>
<term>Wrapper</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: An effective data mining system lies in the representation of pattern vectors. For many bioinformatic applications, data are represented as vectors of extremely high dimension. This motivates the research on feature selection. In the literature, there are plenty of reports on feature selection methods. In terms of training data types, they are divided into the unsupervised and supervised categories. In terms of selection methods, they fall into filter and wrapper categories. This paper will provide a brief overview on the state-of-the-arts feature selection methods on all these categories. Sample applications of these methods for genomic signal processing will be highlighted. This paper also describes a notion of self-supervision. A special method called vector index adaptive SVM (VIA-SVM) is described for selecting features under the self-supervision scenario. Furthermore, the paper makes use of a more powerful symmetric doubly supervised formulation, for which VIA-SVM is particularly useful. Based on several subcellular localization experiments, and microarray time course experiments, the VIA-SVM algorithm when combined with some filter-type metrics appears to deliver a substantial dimension reduction (one-order of magnitude) with only little degradation on accuracy.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>République populaire de Chine</li>
<li>États-Unis</li>
</country>
<region>
<li>New Jersey</li>
</region>
<settlement>
<li>Princeton (New Jersey)</li>
</settlement>
<orgName>
<li>Université de Princeton</li>
</orgName>
</list>
<tree>
<noCountry>
<name sortKey="Mak, Man Wai" sort="Mak, Man Wai" uniqKey="Mak M" first="Man-Wai" last="Mak">Man-Wai Mak</name>
</noCountry>
<country name="États-Unis">
<region name="New Jersey">
<name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
</region>
<name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
<name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
</country>
<country name="République populaire de Chine">
<noRegion>
<name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002A84 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002A84 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:DEC7884D0F11AB1F2B7BBF66EAC1290D1D036BB3
   |texte=   Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021